A PHASE TRANSITION FOR THE LIMITING SPECTRAL 
DENSITY OF RANDOM MATRICES 



OLGA FRIESEN AND MATTHIAS LOWE 



Abstract. We analyze the spectral distribution of symmetric random matrices 
with correlated entries. While we assume that the diagonals of these random 
matrices are stochastically independent, the elements of the diagonals are taken 
to be correlated. Depending on the strength of correlation the limiting spectral 
distribution is either the famous semicircle law or some other law, related to that 
derived for Toeplitz matrices by Bryc, Dembo and Jiang (2006). 



1. Introduction 

Historically, the theory of random matrices is fed by two sources. They were intro- 
duced in mathematical statistics by the seminal work of Wishart |Wis28] . On the 
other hand, Wigner used random matrices as a toy model for the energy levels and 
excitation spectra of heavy nuclei |Wig58 . From these two roots random matrix 



theory has grown into an independent mathematical theory with applications in 
many areas of science. 

A central role in the study of random matrices with growing dimension is played by 
their eigenvalues. To introduce them let, for any n G N, {a n (p, q), 1 < p < q < n} 
be a real valued random field. Define the symmetric random n x n matrix X n by 

X n (q,p) = X n (p, q) = -^j=a n (p,q), 1 < p < q < n. 

>n 



We will denote the (real) eigenvalues of X„ by A^ < A^ < . . . Xn ■ Let /x„ be the 
empirical eigenvalue distribution, i.e. 



n 

1 . 



1 n 

E5 („). 



n 

k=l 



Wigner proved in his fundamental work |Wig58| that, if the entries a n (p,q) are 
independent, normally distributed with mean 0, and have variance 1 for off-diagonal 
elements, and variance 2 on the diagonal, the empirical eigenvalue distribution fi n 
converges weakly (in probability) to the so called semicircle distribution (or law), 
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i.e. the probability distribution v on K with density 

v{dx) = - x 2 t\ x \< 2 . 

Quite some effort has been spent in investigating the universality of this result. 
Arnold |Arn71j showed that the convergence to the semicircle law is also true if one 
replaces the Gaussian distributed random variables by independent and identically 
distributed (i.i.d.) random variables with a finite fourth moment. Also the identical 
distribution may be replaced by some other assumptions (see e.g. |Erdllj ). Recently, 
it was observed by Erdos et al. ( |ESY09j ) that the convergence of the spectral 
measure towards the semicircle law holds in a local sense. More precisely, it can be 
proved that on intervals with width going to zero sufficiently slowly, the empirical 
eigenvalue distribution still converges to the semicircle distribution. 
This result therefore interpolates between the global and the local behavior of the 
eigenvalues in the bulk of the spectrum, which was rather recently proved to be 
universal as well in the so-called "four-moment-theorem" ( |TVllj ). 
Other generalizations of Wigners semicircle law concern matrix ensembles with en- 
tries drawn according to weighted Haar measures on classical (e.g., orthogonal, uni- 
tary, symplectic) groups. Such results are particularly interesting since such random 
matrices also play a major role in non-commutative probability (see e.g. |Gui09] . or 
the very recommendable book Anderson, Guionnet, and Zeitouni |AGZ10j ). 
A slightly different approach to universality was taken in |SSB05j and |FLllj . In 
|FLllj we study matrices with correlated entries. It is shown that, if the diagonals 
of X n are independent and the correlation between elements along a diagonal decays 
sufficiently quickly, again the limiting spectral distribution is the semicircle law. 
Universality, however, does have its limitations. As was shown by Bryc et al. 
[BD J06J the limiting spectral distribution of large random Toeplitz or Hankel matri- 
ces is not the semicircle law. In fact, not much is known about the limiting measures, 
apart from their moments (which are the result of the proof by a moment method, 
a technique, that will also be employed by the present paper). 
The present note tries to explore the borderline between the weak correlations stud- 
ied in |FL11] and the strong correlations that lead to a limiting spectral distribution 
that is not of Wigner type. We will again assume that X n has independent diag- 
onals and we will see, which quantity determines whether the limiting measure of 
the empirical eigenvalue distribution is a semicircle law or not. A particularly nice 
example is borrowed from statistical mechanics. There the Curie- Weiss model is the 
easiest model of a ferromagnet. Here a magnetic substance has little atoms that 
carry a magnetic spin, that is either +1 or —1. These spins interact in cooperative 
way, the strength of the interaction being triggered by a parameter, the so-called 
inverse temperature. The model exhibits phase transition from paramagnetic to 
magnetic behavior (the standard reference for the Curie- Weiss model is jE1185j ). We 
will see that this phase transition can be recovered on the level of the limiting spec- 
tral distribution of random matrices, if we fill their diagonals independently with the 
spins of Curie- Weiss models. For small interaction parameter, this limiting spectral 
distribution is the semicircle law, while for a large interaction parameter we obtain 
a distribution similar to the Toeplitz case. 



A PHASE TRANSITION IN RANDOM MATRIX THEORY 



3 



The rest of this paper is organized as follows. Section 2 contains the technical 
assumptions we have to make together with the statement of our main result. Section 
3 characterizes the various limiting distributions we obtain. Section 4 contains some 
interesting examples, while Section 5 is devoted to the proof of the main theorem. 



2. Main Result 

This section contains the general theorem that describes the various limiting spec- 
tral distributions for the matrices X n introduced above. In order to be able to state 
the theorem we will have to impose the following conditions on X n : 



(CI) E [a n (p, q)} = 0, E [a n (p, q) 2 ] = 1 and 



rrik := sup max E 

neN 1 <P<Q< n 



\a n (p,q)\ l 



< oo, k G N. 



(2.1) 



(C2) the diagonals of X n , i.e. the families {a n (p,p + r), 1 < p < n — r}, < r < 

n — 1, are independent, 
(C3) the covariance of two entries on the same diagonal depends only on n, i.e. 

for any < r < n — 1 and 1 < p,q < n — r, p ^ q, we can define 

Cov (a n (p, p + r),a n (q,q + r)) =: c n , 
(C4) the limit c := lim^oo c n exists. 



With these notations and conditions we are able to formulate the central result of 
this note. 

Theorem 2.1. Assume that the symmetric random matrix X n as defined above 
satisfies the conditions (CI), (C2), (C3) and (C4)- Then, with probability 1, the 
empirical spectral distribution fi n of X n converges weakly to a nonrandom probability 
distribution u c which does not depend on the distribution of the entries of X n . 



3. The Limiting Distribution v r 



Since the proof of Theorem 2.1 relies on the so-called moment-method, we want 
to describe the limiting spectral distribution v c in terms of its moments. It is not 
surprising that v c is some combination of the semicircle distribution and the limiting 
distribution of Toeplitz matrices as described in |BDJ06j . Indeed, c = covers 
the case of independent entries implying that v§ is the semicircle law. On the 
other hand, considering symmetric Toeplitz matrices, we have c = 1, and thus v\ 
is the corresponding limiting distribution we want to introduce in the following (cf. 
[BDJ06J). Therefore, we have to start with some notation. For any even k G N, 
let VV(k) denote the set of all pair partitions ir of {1, . . . , k}. If i and j are in the 
same block of tt, we also write i ~ n j. The measure z/ x can be defined with the help 
of Toeplitz volumes. Thus, we associate to any partition 7r G W(k) the following 
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system of equations in unknowns x , . . . ,x^. 

x x - x + x h - x h -i = 0, if 1 ~^ Zi, 
x 2 - x\ + x h - xi 2 -i = 0, if 2 ~ n / 2; 



- ffi-i + 2^ - = 0, it Z ~tt fj, 



x fe - + xu - xu-i = 0, if k Z 



fe- 



Since tt is a pair partition, we in fact have only k/2 equations although we have listed 
k. However, we have k + 1 variables. If n = {{h,ji}, • • • , {ik/2, jk/2}} with i\ < j% for 
any 1 = 1,..., k/2, we solve (3.1 ) for x^, . . . , Xj k/2 , and leave the remaining variables 



undetermined. We further impose the condition that all variables Xq, . . . ,Xk lie in 
the interval I = [0, 1]. Solving the equations above in this way determines a cross 
section of the cube / fc / 2+1 . The volume of this will be denoted by pt(tt). 
Returning to the measure u%, we can use the results in [BDJ06J to see that all odd 
moments of v\ are zero, and for any even fceN, the fc-th moment is given by 



J x k dvi{x) = Pt(tt 



The expression above is bounded by (k — 1)!!. Hence, Carleman's condition is 
satisfied implying that the distribution v\ is uniquely determined by its moments. 
Moreover, it has an unbounded support as verified in |BDJ06] . To describe v c for 
general c G K, we need a further definition which was introduced in [BDJ06J to 
analyze Markov matrices. 

Definition 3.1. Let k £ N be even, and fix tt £ VV(k). The height h(7r) of tt is 
the number of elements i ~ 7r j , i < j , such that either j = i + l or the restriction of 
ii to {i + 1, . . . ,j — 1} is a pair partition. 

Note that the property that the restriction of tt to {i+l, ... ,j — 1} is a pair partition 
in particular requires that the distance j — i — 1 > 1 is even. To give an example how 
to calculate the height of a partition, take n = {{1, 6}, {2, 4}, {3, 5}}. Considering 
the block {1,6}, we see that the restriction of n to {2,3,4,5} is a pair partition, 
namely {{2, 4}, {3, 5}}. However, this is not true for both remaining blocks. Hence, 
h(ir) = 1. 

In Section |5j we will see that all odd moments of v c vanish, and the even moments 
are given by 

j x k dis c (x) = Ch + PT(ir)c^ h{n) = PA^- h{n \ (3.2) 

TreCVV(k) TreVV(k) 

where = k \^+i)\ denotes the fc-th Catalan number, and CW{k) is the set of 
crossing pair partitions of {1, ... , k}. Here, we say that a pair partition 7r is crossing 
if there are indices i < j < I < m with % ~ 7r I and j ~ n m. Otherwise, we call tt non- 
crossing. We will denote the set of all non-crossing pair partitions of {1, . . . ,k} by 
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MVV(k\ Note that the number of elements in J\fVV(k) coincides with the Catalan 
number Ck/2- The latter is exactly the fc-th moment of the semicircle distribution. 
As for the limiting distribution in the Toeplitz case, we can verify the Carleman 
condition to see that v c is uniquely determined by its moments. 

4. Examples 

In this section, we want to give some examples of processes satisfying the assump- 
tions of Theorem 12.11 

4.1. Toeplitz Matrices. Consider a symmetric Toeplitz matrix. The limiting spec- 



tral distribution calculated in [BDJ06J can be deduced from Theorem 2.1 as well. 
Indeed, assuming that the entries are centered with unit variance and have existing 
moments of any order, we see that all conditions are satisfied with c = c n = 1. Thus, 
we get 

Ck + Pt{^) = yj PtC 71 ")) if k is even, 

X k dV\ (x) = ^ vreCPP(fc) 7reVV{k) 



0, if k is odd. 

4.2. Exchangeable Random Variables. Suppose that for any n G N, we have a 
family {x n (p), 1 < p < n} of exchangeable random variables, i.e. the distribution of 
the vector (x n (l), . . . , x n (n)) is the same as that of (x n (a(l)), . . . , x n (a(n))) for any 
permutation a of {1, ... , n}. In this case, we can conclude that for any 1 < p < q < 
n, we have 

Gov(x n (p),x n (q)) = Cov(a; n (l),x n (2)) =: c n . 

Now assume that c n — > c G R as n — > oo. Define for any n G N, r e {0, . . . , n — 1}, 
the process {a n (p,p + r), 1 < p < n — r} to be an independent copy of {x n (p), 1 < 



p < n — r}. Then, all conditions of Theorem 2.1 are satisfied if we ensure that 
the moment condition (CI) holds. The resulting limiting distribution for different 
choices of c is depicted in Figure [TJ 

An example for a process with exchangeable variables is the Curie- Weiss model with 
inverse temperature (3 > 0. Here, the vector x n = (x n (l), . . . , x n {n)) takes values in 
{—1, l} n , and for any u = (w(l), • • • , w(n)) G { — 1, 1}™, we have 

/ / „ \ 2 



1 




where Z n ^ is the normalizing constant. Since P(x„(l) = —1) = P(x n (l) = 1) = |, 
we obtain E[a; ri (l)] = 0. Further, we clearly have E[x n (l) 2 ] = 1. It remains to 
determine c = lim^oo c n . Therefore, we want to make use of the identity 

Ti 1 

c n = Cov(x n (l),x n (2)) = E[x n (lK(2)] = — -E[m 2 n ] - ^-j, 

where m n := - Y^7=i x n(i) is the so-called magnetization of the system. Since \m n \ < 
1, we see that m n is uniformly integrable. Thus, m n converges in Jzf 2 to some random 
variable m if and only if m n — > m in probability. In |EN78j . it was verified that 
m n — > in probability if (3 < 1, and m n — > m with m ~ \&m(fi) + |^-m(,9) f° r some 
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(a) c = 0.25 (b) c = 0.5 

0.4 n 




-3-2-1 1 2 3 
(c) C = 0.75 



Figure 1. Histograms of the empirical spectral distribution of 100 
realizations of 1000 x 1000 matrices X 10 oo with standard Gaussian 
entries. 



m(/3) > if (3 > 1. The function m(f3) is monotonically increasing on (1, oo), and 
satisfies m(/3) — > as (3 \ 1 and m((3) — > 1 as /3 — > oo. We now obtain 

JO, if/3<l, 
C "J™ C "" jm(/3) 2 , if/3>l. 

Thus, the limiting spectral distribution of X n is the semicircle law if (3 < 1, and 
approximately the Toeplitz limit if (3 is large. This is insofar not surprising as the 
different sites in the Curie-Weiss model show little interaction, i.e. behave almost 
independently, if the temperature is high, or, in other words, (3 is small. However, 
if the temperature is low, i.e. (3 is large, the magnetization of the sites strongly 
depends on each other. The phase transition at the critical inverse temperature 
(3 = 1 in the Curie- Weiss model is thus reflected in the limiting spectral distribution 
of X n as well. 



5. Proof of Theorem 12.11 

The main technique we want to apply is the method of moments. The idea is 
to first determine the weak limit of the expected empirical spectral distribution. 
Therefore, the similar structure of the matrices under consideration allows us to 
repeat some concepts presented in |FLllj . However, we need to develop new ideas 
when calculating the expectations of the entries. 
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5.1. The expected empirical spectral distribution. To determine the limit of 
the k-th moment of the expected empirical spectral distribution //„ of X n , we write 



E 



x k djji n (x) 



iE[tr(X£)] 



1 U 

— - E ^[a(pi,P2)a( y p 2 ,P3)---a( y pk~i,Pk)a(pk,Pi)]- 



fc +1 

n2 P i,...,Pk=i 



The main task is now to compute the expectations on the right hand side. How- 
ever, we have to face the problem that some of the entries involved are indepen- 
dent and some are not. To be more precise, a(pi, q±), . . . , a(pj, qj) are indepen- 
dent whenever they can be found on different diagonals of X n , i.e. the distances 
\pi — qi\, . . . , \pj — qj\ are distinct. Hence, a first step in our proof is to consider the 
expectation E [a(pi, ^2)^(^2, Pz) • • • a (Pk-i, Pk) a {Pki Pi)]i an d to identify entries with 
the same distance of their indices. Therefore, we want to adapt some concepts of 
[SSB05] and |BDJ06] to our situation. 

To start with, fix k G N, and define T n {k) to be the set of A;-tuples of consistent 
pairs, that is multi-indices (Pi, . . . , Pk) satisfying for any j = 1, . . . , k, 



(i) Pj = (pj,qj) G {l,...,n} 2 , 

(ii) qj = Pj+i, where k + 1 is cyclically identified with 1. 

With this notation, we find that 

^E [tr (Xj)] =4-: E E K( Pl ) ■ • • a ^ Pfe )] ' 
n (P 1 ,...,p k )eT n (k) 

To reflect the dependency structure among the entries a n {P\) . . . a n (Pfc), we want to 
make use of the set V(k) of partitions of {1, ... , k}. Thus, take 7r e V(k). We say 
that an element (Pi, . . . ,Pk) G T n {k) is a 7r- consistent sequence if 

\Pi - ft I = \Pj ~ Qj\ i ~n j- 

According to condition (C2), this implies that ^(PjJ, . . . , a n (P it ) are stochastically 
independent if ii, . . . ,ii belong to I different blocks of it. The set of all 7r-consistent 
sequences (Pi, ...,P k ) G T n {k) is denoted by S n (ir). Note that the sets S n (ir), 
7r G V(k), are pairwise disjoint, and UyrePO) ^ n ( 7r ) = Tn{k). Consequently, we can 
write 

-E [tr (X*)] =-4- E E E [ a ™( p ) • • • a ^ Pk ^ ■ ( 5 - X ) 
n TreP(fc) (Pi,...,P fc )eSnW 



In a next step, we want to exclude partitions that do not contribute to (5.1) as 
n — > 00. These are those partitions satisfying either #7r > | or #7r < |, where #7r 
denotes the number of blocks of n. We want to treat the two cases separately. 

First case: #7r > |. Since n is a partition of {1, ... , k}, there is at least one single- 
ton, i.e. a block containing only one element i. Consequently, a n (Pi) is independent 
of {a n (Pj),j 7^ i} if (Pi,...,Pfc) G S n (n). Since we assumed the entries to be 
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centered, we obtain 

E[On(Pi) 

This yields 



■ ■ a n (P k )] = E [ H a n (P,)J E [a n (P z )] = 0. 



712 



+1 



(Pi 



■ ,Pk)&S n (n) 



0. 



Second case: r := ^7r < |. Here, we want to argue that n gives vanishing contri- 



bution to (5.1 ) as n — > oo by calculating #S n (ir). To fix an element (Pi, . . . , P k ) G 
S n (7i), we first choose the pair Pi = (pi,qi). There are at most n possibilities to 
assign a value to pi, and another n possibilities for q x . To fix P 2 = G°2,<?2), note 
that the consistency of the pairs implies P2 = q±. If now 1 ~„. 2, the condition 
\Pi ~ qi\ — \P2 — q2\ allows at most two choices for q 2 . Otherwise, if 1 7^ 2, we have 
at most n possibilities. We now proceed sequentially to determine the remaining 
pairs. When arriving at some index i, we check whether i is in the same block as 
some preceding index 1, ... ,i — 1. If this is the case, then we have at most two 
choices for Pi and otherwise, we have n. Since there are exactly r = #tt different 
blocks, we can conclude that 



#S„(tt) < n 2 n r - l 2 k - r < C n r+1 
with a constant C = C(r, k) depending on r and k. 



(5.2) 



Now the uniform boundedness of the moments (2.1) and the Holder inequality to- 
gether imply that for any sequence (Pi, 



Pk) 



|E K(Pi) • • • a n {P k 



< 



E 1 0„ (Pi 



E|a n (P fc ) 



< m k . 



(5.3) 



Consequently, taking account of the relation r < |, we get 



k 



1 



(-Pi, 



J2 \E[a n (Pi)---a n (P k ) 



< C < C 



722 



+1 



fe 

«2" 



0(1). 



Combining the calculations in the first and the second case, we can conclude that 

ii 

n 



E[tr(X*)] = - 1 J- E nan(Pi) ■ ■ ■ a n (P k )} + o(l) 



Trev(k), (p 1 ,...,p fc )eS„(7r) 



Now assume that k is odd. Then the condition #7r = | cannot be satisfied, and the 
considerations above immediately yield 



lim -E [tr (X£)] = 0. 



n— >oo fl 



It remains to determine the even moments. Thus, let k e N be even. Recall that we 
denoted by VV(k) C V(k) the set of all pair partitions of {1, ... , k}. In particular, 
#tt = § for any tt G VV{k). On the other hand, if #7r = f but tt £ PP(A;), 
we can conclude that 7r has at least one singleton and hence, as in the first case 
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above, the expectation corresponding to the 7r-consistent sequences will become 
zero. Consequently, 

-E[tr(x£)] =4-7 E E E[a n (Pi)---a n (P k )]+o(l). (5.4) 

nevv(k) (p u ...,p h )eS„W 

We have now reduced the original set V(k) to the subset VV(k). Next we want to 
fix a 7r e VV(k) and concentrate on the set S n (n). The following lemma will help 
us to calculate that part of (5.4) which involves non-crossing partitions. 

Lemma 5.1 (cf. |BDJ06j . Proposition 4.4.). Let S*(7r) C S n (ir) denote the set of 
7r- consistent sequences (Pi, . . . , P&) satisfying 

i~Trj Qi ~Pi = Pj - qj 

for all i j . Then, we have 

#(^)\s;(vr)) = o(V+i' 



Proof. We call a pair (Pj, Pj) with i ~ 7r j, i ^ j, positive if q% — Pi = qj — Pj > 
and negative if qi — Pi = qj — Pj < 0. Since J2i=i Qi ~ Pi = by consistency, the 
existence of a negative pair implies the existence of a positive one. Thus, we can 
assume that any (Pi,...,Pk) G S n (n)\S^(ii) contains a positive pair (Pi,P m ). To 
fix such a sequence, we first determine the positions of I and m, and then fix the 
signs of the remaining differences qi—pi. The number of possibilities to accomplish 
this depends only on k and not on n. Now we choose one of n possible values for 
Pi, and continue with assigning values to the differences \q%—Pi\ for all Pj except for 

k 

Pi and P m . Since 7r is a pair partition, we have at most n^- possibilities for that. 
Then, Yli=i qi ~ Pi = ® implies that 

0<2{qi-pi) = qi-pi + q m -p m = E Pi ~ 

ie{x,...,k}, 

Since we have already chosen the signs of the differences \qi — Pi\, % ^ l,m, as well 
as their absolute values, we know the value of the sum on the right hand side. 
Hence, the difference qi — Pi = q m — p m is fixed. We now have the index p h all 
differences \qi—pi\,i G {1, . . . , k}, and their signs. Thus, we can start at Pi and 
go systematically through the whole sequence (Pi, . . . , Pk) to see that it is uniquely 
determined. Consequently, our considerations lead to 

#(S n (n)\S:(n))<Cnl = o . 

□ 

A consequence of Lemma |5.1| and relation ( |5.3[ ) is the identity 

[tr (X*)] = -4- E E E[a n (Pi)---a n (P k )]+o(l). (5.5) 

n£VV(k) (P u ...,P k )eS*(n) 

As already mentioned, the sets S*(tt) help us to deal with the set J\fVV(k) of non- 
crossing pair partitions. 
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Lemma 5.2. Let tt G MVV{k). For any (P 1( . . . , P fe ) G S*(ir), we have 

EMPO-^Pk)] = 1. 

Proof. Let / < m with / ~ 7r m. Since 7r is non-crossing, the number I — m — 1 of 
elements between / and m must be even. In particular, there is I < i < j < m with 
i j and j = i + 1. By the properties of S*(tt), we have a n (p) = a n (Pj), and 
the sequence (Pi, . . . , Pi, . . . , P_i, Pi+2, ■ ■ ■ , P m , ■ ■ ■ , Pt) is still consistent. Applying 
this argument successively, all pairs between / and m vanish and we see that the 
sequence (Pi, . . . , p, P m , . . . , Pj.) is consistent, that is qi = p m . Then, the identity 
pi = q m also holds. In particular, a n (p) = a n (P m ). Since this argument applies for 
arbitrary I ~ 7r m, we obtain 

E [a n {P 1 ) ■ ■ ■ a n {P k )} = E K{Pi)a n {P m )) = 1- 

Z<m, 

□ 

By Lemma [5. 2[ we can conclude that 

4^1 E E E K(P0 • • • a n (P fc )] = ^ £ #^(tt). 

77,2 TreJVVV(k) (Pi,...,P fc )e5*(vr) 77,2 ireJVVV(k) 

The following lemma allows us to finally calculate the term on the right hand side. 
Lemma 5.3. For any tt G MW(k), we have 

Ihn = 1. 



Proof. Since 7r is non-crossing, we can find a nearest neighbor pair i z + 1. Now 
fix (Pi,...,pt) G S£(7r), and write p = (pi,Pi+i), I = l,...,k, where + 1 is 
identified with 1. Then the properties of S£(7r) ensure that (p«,Pi+i) = (pi+2,Pi+i)- 
Hence, we can eliminate the pairs p, P i+ i to obtain a sequence (P x , . . . , P^} 2 ) '■ = 
(Pi, . . . , Pj_i, P+2, • • • , Pfc) which is still consistent. Denote by tt' the partition ob- 
tained from tt by deleting the block {i, i + 1}, and relabeling any I > i + 2 to I — 2. 

bmce 7r is non-crossing, we have tt' G MVV(k - 2). Moreover, (P^, . . . , P, {1) ^ 



k-2J 



S*(tt'). Thus we see that any (Pi, . . . , P k ) G S*(tt) can be reconstructed from a tuple 
(Pi"^, • • • , Pk-2) e S*(tt') and a choice of Pi+i- The latter admits 71 — possibilities 
since {i, % + 1} forms a block on its own in 71. Consequently, 

#5£(tt) #S* n (TT' 



+ 0(1). (5.6) 

f!2 + 712 



#S*(tt) 



Now if k = 2, we get S*(tt) = {((p,q), (q,p)) : p, g G {1, . . . ,n}}, implying 
1. For arbitrary even fceN, the statement of Lemma 5.3 follows then by induction 
using the identity in ( 5.6[ ). □ 
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Taking account of the relation #AfW(k) = Cu, we now arrive at 
1. 



n 



E[tr (X*)] 

= Ch + -£- E ^[a n (Pi)---a n (P k )]+o(l), (5.7) 

neCVV{k) (P u ...,P k )eS*(n) 

with CVV(k) being the set of all crossing pair partitions of {1, . . . , fc}. Since we 
consider only pair partitions, we know that the expectation on the right hand side 
is of the form 

E [a n (pi, qi)a n (pi + n, q 1 + n)] ■ ■ ■ E [a n (p r , q r )a n (p r + r r , q r + r r )] , 

for r := | and some choices of p\, qi, T\, . . . ,p r , q r , T r G N. In order to calculate 
this expectation, assumption (C3) indicates that we only need to distinguish for 
any i = l,...,k, whether we have 7$ = or not. In the first case, we get the 
identity E [a n (pi, qi)a>n(Pi + Ti, qi + Ti)} = 1, in the second we can conclude that 
E [a n (pi, qi)a n (pi + Ti, qi + Ti)} = c n . Fix some pair partition n G W(k), and take 
(Pi, . . . , Pfc) G S*(tt). Motivated by these considerations, we put 

m (Pi, . . . , P k ) := #{1 <i<j<k: a n (P t ) = a n (P,)}. 
Obviously, we have < m (Pi, . . . , P fc ) < |. With this notation, we find that 

fc/2 



4^ E E [^( p i) • • • = E ^"'M w > ( 5 - 8 ) 

n2 (^.....JikjeS'W n2 «=o 

where 

A« (vr) := {(Pi, . . . , P k ) G S* n (7r) : m (Pi, . . . , P k ) = I}. 

The following lemma states that if a pair Pj, Pj contributes to m(Pi, . . . , P^), then 
we can assume that the block {i,j} in n is not crossed by any other block. 

Lemma 5.4. Let n G VV(k) and fix i ~ n j } i < j . Define 

Snfchj) '■= {(Pi,---,Pk) e ^(tt) : Pi = (pi,qd,Pj = (Pj,Qj),Pi = qj,q% = Pj}- 

Assume that there is some i' ~ 7r j' such that i < i! < j , and either j' < i or j < j' . 
Then, 

#S;(7r;*,j)=o(nf +1 ). 



Proof. To fix some (Pi, . . . , Pk) G S*(n; i,j), we first choose a value for pi = qj and 
qi = pj. This allows for at most n 2 possibilities. Hence, Pj and Pj are fixed. Now 
consider the pairs P«+i, . . . , Pj'-i- Pi+i is uniquely determined by consistency. For 
qi+i, there are at most n choices. Then, p i+2 = If i + 2 ~ 7r i + 1, we have one 
choice for g»+2- Otherwise, there are at most n. Proceeding in the same way, we see 
that we have n possibilities whenever we start a new equivalence class. Similarly, 
we can assign values to the pairs Pj, . . . , Pj'+i in this order. Now is determined 
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by consistency. When fixing p_i, . . . , Pi, Pk, ■ ■ ■ , Pj+i, we again have n choices for 
any new equivalence class. To sum up, we are left with at most 



n n 2 



n 2 



possible values for an element in S*(Tr;i,j). 



□ 



Recall Definition 3.1 
partition ir. Lemma 
with 



where we introduced the notion of the height h(ir) of a pair 
5.4 in particular implies that only those (Pi, . . . , Pk) G S*(ir) 



<m(P 1 ,...,P k ) < h(n) 



contribute to the limit of (5.8). Indeed, if m{P\, . . . , Pk) > h{n), we can find some 
i j, i < j, such that (Pi, . . . , P k ) G S^(ii; i, j) and neither j — i + 1 nor is the 
restriction of ir to {i + 1, . . . ,j — 1} a pair partition. Hence, the crossing property 
in Lemma 5.4 is satisfied, and (Pi, . . . , Pk) is contained in a set that is negligible in 
the limit. The identity in (5.8) thus becomes 



-+i 



1 ^ 

E MP,) ■ ■ ■ a n (P k )} = cI l #B«\n) + o(l 

(P 1 ,...,P fc )e5i( 7 r) n2 1=0 



where 



P«(tt) := {(Pi, ... , P fe ) G 5;(tt) : m (Pi, . . . , P fc ) = I; 

o-n{Pi) = a n(Pj),i < j =^ j = i + I or 7i~\{i+i,...j-i} is a pair partition} . 



In the next step, we want to simplify the expression above further by showing that 
whenever < / < h(ir). This is ensured by 



Lemma 5.5. Let tt G Wik). For any (Pi, . . . ,P}~) G S*(tt), we have 

m(P 1 ,...,P k ) > h(n). 



Proof. If h(ir) = 0, there is nothing to prove. Thus, suppose that h(7r) > 1 and take 
some i ^ j, i < j, such that either j = i + 1 or j — i — 1 > 2 is even and the restriction 
of 7r to {i + 1, . . . ,j — 1} is a pair partition. Fix (Pi, . . . , Pk) G ^(vr), and write 
P — (PhPi+i) f° r ari Y l = 1, ■ ■ ■ ,k. We need to verify that pi + i = pj. If we achieve 
this, the definition of ^(vr) will also ensure that pi = Pj+x- As a consequence, the 
7r-block {i, j} will contribute to m{Pi, . . . , Pk)- Since there are h(n) such blocks, we 
will obtain m(Pi, . . . , Pk) > /i(vr) for any choice of (Pi, . . . , Pfc) G ^(ff). 
If j = i + we immediately obtain p i+1 = pj. To show this property in the 
second case, note that the sequence (P+i, . . . ,Pj-i) solves the following system of 
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equations: 

Pi+2 ~ Pi+1 + Ph+i -Ph = 0, if i + 1 h, 
Pi+3 - Pi+2 + Ph+i - Ph = °) if i + 2 ~tt k, 

Pi+m+i - Pi+m + Pi m +i ~Pl m = 0, ifi + m ~ w l m , 

Pj - Pj-i + Pij-i-i+i - Ph-i-i =0, if j — 1 ~„ 
Start with solving the first equation for p i+2 which yields 

Pi+2 = Pi+i - Ph+i +Ph- 
Then, insert this in the second equation, and solve it for pi + 3 to obtain 

Pi+3 = Pi+l ~ Ph+i + Ph ~ Ph+i + Ph- 

In the j — i — 1-th step, we substitute = pj + Q_j_x) in the j — i — 1-th equation, 
and solve it for pj = We then have 

j-i-l 

pj = p i+1 - (pu+i -pij- 

m=l 

Since the restriction of 7r to {i + 1, . . . ,j — 1} is a pair partition, we can conclude 
that the sets {/i, . . . , and {i + 1, . . . ,j — 1} are equal. Hence, we obtain 

E^/k+i - PiJ = Pj - Pi+i, implying Pj = p i+1 . 

□ 



With the help of Lemma 5.5, we thus arrive at 



£ E K(P0 ■ ■ ■ a n (P fe )] = cT*™ + 



k+1 Z / L '" v " v '" /J £+1 

n2 (P t) ... ) fl k )6SS(»r) 712 



Note that any element (Pi, . . . , Pf.) G 5£(7r) satisfying the condition 

a>n(Pi) = a n(Pj), i < j =>- j = * + 1 or 7r | is a pair partition, (5.9) 



fulfills the condition m(Pi, . . . , P&) = h(ir) as well. Indeed, (5.9) guarantees that 



m(Pi, . . . , Pfe) < /i(vr), and Lemma 5.5 ensures that m(Pi, . . . , Pk) > h(ir). Thus, 
we can write 

B^\n) = {(P u ...,P k )eS:(n): 

a>n(Pi) — o, n (Pj),i < j => j = i + 1 or 7r|{j+i, ...,j-iy is a pair partition} . 

Now any element in the complement of Pn (-7r) satisfies for some i ^ w j the 



crossing assumption in Lemma |5.4[ This yields 

= o(l). 
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Since B { n h{n) \n) U (B£ iir)) (tc)Y = S*{ir), we obtain that 



1 



n 2 



+i 



£ E[a n (P 1 )---a n (P k )} 



+ o(l). (5.10) 



H2 



(ft ) ...,P fc )eS*( 7 r) 

To calculate the limit on the right-hand side, we have 

Lemma 5.6 (cf. [BDJ06], Lemma 4.6). For any it G VV(k), it holds that 



lim 



k 

712' 



Pr(vr), 



where Pt(^) is the Toeplitz volume defined by solving the system of equations (3.1) 



Proof. Fix 7T G PP(fc). Note that if P = {(pi,p i+1 ),i = l,...,k} G S*(ir), then 
xo, xi, . . . , %k with Xi = Pi+i/n is a solution of the system of equations (3.1 ). On the 
other hand, if x , X\, . . . ,Xk G {1/n, 2/n, . . . , 1} is a solution of (3.1 ) and p i+ i = nx iy 
then either {{pi,p i+ i),i = 1, . . . , k} G 5;(tt) or {(pi,p l+ i), i = l,...,fc} G 5 n (7/) for 
some partition rj G "P(A;) such that i ^ j =>■ z ~ ?? j, but < #7r. 



In (3.1), we have fc + 1 variables and only k/2 equations. Denote the k/2 + 1 un- 



determined variables by y\, . . . ,yk/2+i- We thus need to assign values from the set 
{1/n, 2/n, . . . , 1} to yi, . . . , yjfe/2+i! and then to calculate the remaining k/2 vari- 
ables from the equations. Since the latter are also supposed to be in the range 
{1/n, 2/n, . . . , 1}, it might happen that not all values for the undetermined vari- 
ables are admissible. Let p n (n) denote the admissible fraction of the n k / 2+l choices 



for yi, . . . , 2/&/2+1- By our remark at the beginning of the proof and estimate (5.2) 
we have that 



lim 



712 



+1 



lim p n (ir) 



if the limits exist. Now we can interpret y±, . . . , y k/2+1 as independent random vari- 
ables with a uniform distribution on {1/n, 2/n, . . . , 1}. Then, p n {^) is the probability 
that the computed values stay within the interval (0,1]. As n — > 00, yi, . . . ,y k/2+1 
converge in law to independent random variables uniformly distributed on [0,1]. 
Hence, p n (vr) — > Pt(k). □ 



Applying Lemma 5.6 and assumption (C4) to equation (5.10), we arrive at 



lim 



J2 E [a n {Pi) ■ ■ ■ a n (P k )\ = Pt(k)c2 



l-fc(ir) 



(P U -,Pk)&S*(n) 



Substituting this result in (5.7), we find that for any even k G N, 



lim -E [tr (X*)l =C k + V 



Pt{tt)c2 



-h(n) 



TT<=CVV(k) 



To obtain the alternative expression in (3.2) for the even moments of the limit 



ing measure v c , note that the considerations above were not restricted to crossing 
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partitions. In particular, we can start from identity (5.5) instead of (5.7) to see that 
lim -E [tr (X*)l = lim V * S * M 



k 

r 2 ' 
( ri 



-h(7T) 



Tvevv(k) 



712 



Trevv(k) 



5.2. Almost Sure Convergence. The almost sure convergence of the empirical 
distribution is a consequence of the following concentration inequality proven in 
[BD,T06| and jFLTT] . 

Lemma 5.7. Suppose that conditions (CI) and (C2) hold. Then, for any fc,nGN, 



E 



(tr(X*)-E[tr(Xj)]) 4 



<Cn 2 . 



From Lemma 5/7 and Chebyshev's inequality, we can now conclude that for any 
e > and any k, n e N, 



P 



1 



n 



tr (Xj) - E 



tr (Xj) 



g 



Applying the Borel-Cantelli lemma, we see that 



7? 



tr (Xi) - E 



1 



7? 



tr (Xj) 



—7- 0, a.s.. 



(5.11] 



Let Y be a random variable distributed according to v c . The convergence of the 



moments of the expected empirical distributions and relation (5.11) yield 

1 



?? 



tr(X9^E[y fc ], a.s.. 



Since the distribution of Y is uniquely determined by its moments, we obtain almost 
sure weak convergence of the empirical spectral distribution of X n to v c . 
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